Evolutionary Distances in the Twilight Zone—A Rational Kernel Approach

نویسندگان

  • Roland F. Schwarz
  • William Fletcher
  • Frank Förster
  • Benjamin Merget
  • Matthias Wolf
  • Jörg Schultz
  • Florian Markowetz
چکیده

Phylogenetic tree reconstruction is traditionally based on multiple sequence alignments (MSAs) and heavily depends on the validity of this information bottleneck. With increasing sequence divergence, the quality of MSAs decays quickly. Alignment-free methods, on the other hand, are based on abstract string comparisons and avoid potential alignment problems. However, in general they are not biologically motivated and ignore our knowledge about the evolution of sequences. Thus, it is still a major open question how to define an evolutionary distance metric between divergent sequences that makes use of indel information and known substitution models without the need for a multiple alignment. Here we propose a new evolutionary distance metric to close this gap. It uses finite-state transducers to create a biologically motivated similarity score which models substitutions and indels, and does not depend on a multiple sequence alignment. The sequence similarity score is defined in analogy to pairwise alignments and additionally has the positive semi-definite property. We describe its derivation and show in simulation studies and real-world examples that it is more accurate in reconstructing phylogenies than competing methods. The result is a new and accurate way of determining evolutionary distances in and beyond the twilight zone of sequence alignments that is suitable for large datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new approach on studying the stability of evolutionary game dynamics for financial systems

‎Financial market modeling and prediction is a difficult problem and drastic changes of the price causes nonlinear dynamic that makes the price prediction one of the most challenging tasks for economists‎. ‎Since markets always have been interesting for traders‎, ‎many traders with various beliefs are highly active in a market‎. ‎The competition among two agents of traders‎, ‎namely trend follo...

متن کامل

Using Metaheuristic Algorithms Combined with Clustering Approach to Solve a Sustainable Waste Collection Problem

Sustainability is a monumental issue that should be considered in designing a logistics system. In order to incorporate sustainability concepts in our study, a waste collection problem with economic, environmental, and social objective functions was addressed. The first objective function minimized overall costs of the system, including establishment of depots and treatment facilities. Addressi...

متن کامل

A Geometry Preserving Kernel over Riemannian Manifolds

Abstract- Kernel trick and projection to tangent spaces are two choices for linearizing the data points lying on Riemannian manifolds. These approaches are used to provide the prerequisites for applying standard machine learning methods on Riemannian manifolds. Classical kernels implicitly project data to high dimensional feature space without considering the intrinsic geometry of data points. ...

متن کامل

Use of structural phylogenetic networks for classification of the ferritin-like superfamily.

In the postgenomic era, bioinformatic analysis of sequence similarity is an immensely powerful tool to gain insight into evolution and protein function. Over long evolutionary distances, however, sequence-based methods fail as the similarities become too low for phylogenetic analysis. Macromolecular structure generally appears better conserved than sequence, but clear models for how structure e...

متن کامل

تحلیل رفتار اوپک با رویکرد نظریه بازی تکاملی

This paper implements an approach to examine economic problems in which rational agents interact in dynamic markets. We use evolutionary game theory and agent-based modeling in tandem as a means to address intertemporal problems that display evolutionary attributes. This study examines the behavior of the Organization of Petroleum Exporting Countries (OPEC) in the global oil markets during the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2010